Journal of Artificial Intelligence Research 18 (2003) 315-349 



Submitted 10/02; published 4/03 



Structure and Complexity in Planning with Unary Operators 



Unary operator domains - i.e., domains in which operators have a single effect - arise 
naturally in many control problems. In its most general form, the problem of STRIPS plan- 
ning in unary operator domains is known to be as hard as the general STRIPS planning 
problem - both arc PSPACE-complete. However, unary operator domains induce a natural 
structure, called the domain's causal graph. This graph relates between the preconditions 
and effect of each domain operator. Causal graphs were exploited by Williams and Nayak 
in order to analyze plan generation for one of the controllers in NASA's Deep-Space One 
spacecraft. There, they utilized the fact that when this graph is acyclic, a serialization 
ordering over any subgoal can be obtained quickly. In this paper we conduct a compre- 
hensive study of the relationship between the structure of a domain's causal graph and the 
complexity of planning in this domain. On the positive side, we show that a non-trivial 
polynomial time plan generation algorithm exists for domains whose causal graph induces 
a polytree with a constant bomid on its node indcgrcc. On the negative side, we show 
that even plan existence is hard when the graph is a directed-path singly connected DAG. 
More generally, we show that the number of paths in the causal graph is closely related to 
the complexity of planning in the associated domain. Finally we relate our results to the 
question of complexity of planning with serializable subgoals. 

1. Introduction 

One of the first well formulated problems addressed by AI researchers was the planning 
problem. Simply stated, it involves the generation of a sequence of system transformations, 
taken out of a given set of system transformations (called actions or plan operators), whose 
combined effect is to move the system from some given initial state into one of a set of 
desired goal states. The planning problem is known to be intractable in general (Chapman, 
1987), and tractable algorithms exist for very restrictive classes of problems only. This 
discouraging fact has not deterred planning researchers. Indeed, many researchers believe 
that real-world problems have some properties, or structure, that could be exploited, either 
implicitly or explicitly. In this paper we attempt to understand the relationship between 
structure and complexity in planning problems in which each action changes the value of a 
single variable. 

To study the relation between the structure and the complexity in a class of problems 
we must identify a set of parameters that characterize it. In the case of planning, a number 

of such problem properties have been studied in the past (which we review in more detail 
in Section 6). These properties have been mostly syntactical, i.e., they involve restriction 
on operators, e.g., the type and number of preconditions or effects that operators have. For 
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example, Bylander (1994) showed that STRIPS planning in domains where each operator is 
restricted to have positive preconditions and one postcondition only is tractable. Backstrom 
and Klein (1991b) considered other, more global types of syntactical restrictions, but using 
a more refined model in which two types of preconditions are considered: prevail conditions, 
which are variable values that are required prior to the execution of the operator and are 
not affected by the operator, and preconditions, which are affected by the operator. For 
example, they have shown that when operators have a single effect, no two operators have 
the same effect, and each variable can be affected only in one context (of prevail conditions) 
then the planning problem can be solved in polynomial time. However, these restrictions 
are very strict, and it is difficult to find reasonable domains satisfying them. 

In this paper we concentrate on more global properties of unary operator domains; 
properties that capture some of the interactions between different planning operators. The 
tool we use to study these properties is the domains' causal graph. A causal graph is a 
directed graph whose nodes stand for the domain propositions. An edge {p, q) appears in 
the causal graph if and only if some operator that changes the vahic of q has a prevail 
condition involving p. Such a problem structure was introduced by Knoblock (1994) in 
the context of automatically generating abstractions for planning. Subsequently, Jonsson 
and Backstrom (1998b) introduced the 3S class of planning problems with unary operators, 
which was characterized by the acyclicity of the causal graph, and some restrictions on the 
operator set. It was shown that determining plan existence for this class of problems is 
polynomial, while plan generation is provably intractable. 

Complexity results for unary operators would be of theoretical interest alone if one 
could not supply interesting problems in which unary operators are used. One interesting 
application in which this problem arises is the determination of dominance relationship 
between different outcomes in a CP-net (Boutilier, Brafman, Hoos, & Poole, 1999). This 
problem is reducible to STRIPS planning with unary operators. 

Another example, of greater interest to the planning community, is a planning-based 
reactive control system that commands the NASA Deep Space One autonomous space- 
craft (Pell, Bernard, Chien, Gat, Muscettola, Nayak, Wagner, & Williams, 1997; Williams 
Sz Nayak, 1996, 1997). This system was hailed by Weld (1999) in his recent survey of AI 
planning as one of the most exciting recent developments in the area of planning. Natu- 
rally, the complete system (Pell et al., 1997) is very complex, however, its configuration 
planning and execution subsystem are of particular interest to us. In the context of control- 
ling Deep-Space One, Williams and Nayak (1996, 1997) present a reactive planner. Burton, 
that generates a single control action for the main engine subsystem of the spacecraft, and 
compensates for anomalies at every step. Given a high-level goal (for example, thrust in 
one of the engines). Burton continually tries to transition the system toward a state that 
satisfies the desired goal. What is particularly relevant for us is that Burton's task can be 
described as a STRIPS planning problem in which each operator affects only a single vari- 
able (hardware component) - Williams and Nayak (1997) argue that in physical hardware 
it is usually the case that each state variable is commanded separately. However, Burton is 
based on two additional important restrictions: First, the planner is explicitly supplied with 
a serialization order for any satisfiable set of goal. Second, all operators must be reversible. 

One of the reasons cited for designing Burton as a reactive planner that generates a 
single action at a time was the potential intractability of generating whole plans. Indeed, 
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Williams and Nayak were pessimistic about the prospects of generating whole plans quickly 
even for Burton, i.e., for problem instances with serializablc sub-goals and single-effect 
operators. As our results show, this pessimism was not fully justified. 

Our work continues the study of planning with unary operators. This apparently easier 
problem is in fact as hard as the general STRIPS planning problem (Bylander, 1994). How- 
ever, we can obtain finer distinctions and some positive results if we pay closer attention to 
the causal structure of the domain. For example, it is easy to show that when the causal 
graph is a tree, it is easy to determine a serializability ordering over any set of sub-goals, 
and consequently, obtain a plan in polynomial time. In this paper we analyze the relation- 
ship between the domain's causal graph and the complexity of plan generation and plan 
existence. In particular we prove the following results: 

• When the causal graph forms a polytree (the induced undirected graph is acyclic), and 
its node indegree is bounded by a constant, then plan existence and plan generation 
are polynomial. 

• When the causal graph is directed-path singly connected (there is at most one directed 
path between any pair of nodes), then plan existence is NP-complete. 

• In general, plan generation for the problems with acyclic causal graphs is provably 
intractable, i.e., the problem requires exponential time. The corresponding claim 
is derived from a previous result by Jonsson and Backstrom (1998b). However, we 
show that the complexity of plan generation for these problems can be bounded by a 
function of the number of paths within the causal graph. 

Note that the complexity of the problems with polytree causal graphs but with unbounded 
node indegree remains an open problem - it is still to be shown whether they can be solved 
in polynomial time, or they are NP-complete. 

Finally, we relate our results to an old open question: how difficult is it to generate 
plans for problems with serializablc subgoals (Korf, 1987)? This question was stated by 
Bylander (1992), and different hypotheses were raised by different researchers. Here, we 
present a clear, though somewhat disappointing answer: First, our results suggest that even 
when the underlying causal graph of the problem is acyclic (and thus the problem is known 
to be serializablc), finding a serialization ordering on the problem subgoals may be hard. 
Second, wc show that even if the actual serialization ordering on the subgoals is known, 
solving the problem is not necessarily easy. 

The rest of this paper is organized as follows: In Section 2 we first introduce some 
basic formalism used in the paper, then discuss, motivate and illustrate the notion of causal 
graph. In Sections 3 and 4 we present our results on the relation between the form of 
the causal graph and the complexity of the planning problem. In Section 5 we discuss the 
sub-goal serializability issue and the impact of our results on it. In Section 6 we describe 
some related work on complexity of planning, and connect our work with the previous 
results. We summarize in Section 7. Finally, Appendix A provides a short review of the 
POP algorithm (Penberthy & Weld, 1992), and Appendix B provides some of the proofs. 
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2. Basic Formalism and Causal Graphs 

In this paper we consider only propositional planning problems, using the propositional 
STRIPS with negative goals formalism (Bylander, 1994), in which both positive and negative 
preconditions are allowed. Following Backstrom and Klein (1991b), we distinguish between 
preconditions and prevail conditions. In the former case the variable involved changes its 
value after the operator is executed, while in the latter case the value does not change. The 
post-condition of an operator expresses which state variables it changes and what values 
these variables will have after executing the operator. The pre-condition specifies which 
values these changed variables must have before the operator is executed. The prevail 
condition specifies which of the unchanged variables must have some specific value before 
execution of the operator and what these values are. Hence, prevail conditions, such as 
having a visa, are needed in order to apply an operator, such as Enter-USA, but their 
values do not change after the operator is applied. Finally, we assume that an operator is 
applicable if and only if both its pre- and prevail conditions are satisfied. 

Formally, we assume that a problem instance is given by a quadruple 11 = (V, A, Init, Goal), 
where: 

• V = {vi, . . . , w„} is a set of propositional state variables, each one with an associated 
binary domain T>{vi). The domain Viv-i) of the variable Vi induces an extended domain 
'D~^{vi) = ^{vi) U {u}, where u denotes the unspecified value. 

• Init is an initial, fully specified state, i.e. Init G V{vi) x . . . x V{vn)- 

• Goal is a set of possible goal states. We assume that such a set is specified by a partial 
assignment on V, thus Goal G X'+(vi) x . . . x P+(w„). 

• A = {^i,...,^jv} is a finite set of operators of the form (pre, post, prv), where 
pre, post, prv C V'^{vi) x ... x P+(u„) denote the pre-, post-, and prevail condition, 
respectively. In what follows, by pre(^), post(yl), and pry{A) we denote the corre- 
sponding conditions of an operator A, and by pre(A)[i], post(^)[i], and prv(^)[i] the 
corresponding values of the variable Vi. 

For every Vi G V, we must have either pre(^)[i] = u or prv(74)[i] = u. Further, 
post(^)[i] 7^ u if and only if pre(^)[i] / u, in which case post(^)[i] / pre(A)[i]. 

In this paper we analyse only planning problems with unary operators. Therefore, in 
what follows, we assume that, for each operator A G A, we have that: 

1. there exists a variable Vi G V, such that pre(A)[i] ^ u, and 

2. for each other variable vj eV — {vi}, pre{A)[j] = u. 

Note that specifying both pre- and postconditions in case of only propositional variables 
is redundant, and we use it only to simplify the presentation. Likewise, our assumption that 
post(74) 7^ u implies pre (A) 7^ u is different from the usual STRIPS formalism, and requires 
an exponential time translation in general. However, in our case of only unary operators, 
this translation takes only linear time. 
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2.1 Causal Graphs 

Causal graphs were used by Williams and Nayak (1997) as a tool for describing the structure 
of planning domains with unary operators. They represent a dependence relation between 
the state variables in the domain. A causal graph ^ is a directed graph whose nodes 
correspond to the state variables. An edge from p to q appears in the causal graph if 
and only if some operator that changes the value of q has a prevail condition involving 
some value of p. Hence the immediate predecessors of g in ^ are all those variables that 
affect our ability to change the value of q. Such a problem structure was introduced by 
Knoblock (1994) in the context of automatic generation of abstractions for planning. The 
causal graph is an intuitive model which is easily constructed given any planning problem. 

Causal graphs are not the only graphical structure that can be derived from a given 
planning problem, and effectively exploited in solving it. For instance, graphs in which 
operators and literals (and not variables/propositions) are represented by the nodes, and the 
edges represent both prevail and preconditions were introduced by Etzioni (1993) and Smith 
and Peot (1993). In particular, problem space graphs of Etzioni (1993) and operator graphs 
of Smith and Peot (1993) were proposed as mechanisms to reduce the number of threats 
that arise during the total-order and partial-order planning, respectively. However, in this 
paper we focus on the causal graphs, since they were shown to be especially informative 
when all operators are unary (Jonsson &; Backstrom, 1998b; Williams &; Nayak, 1997). 

Causal graphs have an important potential role in the design of autonomous industrial 
systems, as argued and demonstrated by Williams and Nayak (1997): Unary operators are 
natural when the manipulated objects are hardware components, since the basic control 
actions in such systems change the state of a single hardware component. The applicability 
of these control actions in any state depends on the state of the affected component as 
well as on the state of the related hardware components. This naturally gives rise to a 
planning domain with unary operators. Moreover, since the state variables correspond 
to hardware components, in the induced causal graph we typically see that the prevail 
dependencies between variables are usually implicitly entailed by the intcr-composition of 
the hardware components. Thus, the causal graph of such domains resembles the structure 
of and the relationships between the system's hardware components. This resemblance has 
important practical ramifications for system design given the relationship between causal 
graph structure and the complexity of plan generation: It enables the system designer to 
consider the effect of his hardware design on the system's ability to autonomously generate 
control sequences. 

A case in point is the planning problem studied by Williams and Nayak (1997), which 
had a number of important features: all operators were unary and reversible, and the causal 
graph was acyclic. Williams and Nayak argued that acyclic connectivity frequently occurs 
in designed systems. However, the requirement that all operators should be reversible seems 
to us restrictive, and it has important impact on the complexity of the problem. In the 
case of the Burton planner (Williams &; Nayak, 1997), there were good reasons to make this 
assumption. Burton's reactive nature precludes extensive deliberation on the consequences 
of its operators. Thus it leaves open the possibility that operators may degrade the system's 
capabilities, leading it to dead-ends. In that case, the restriction to reversible operators was 
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required in order to achieve a more reliable system. As we show later, in certain cases, 
complete plans can be generated efficiently even when the operators are not reversible. 

Williams and Nayak's work has another interesting aspect, as noted by Weld (1999). 
For a long time, researchers have known that planning problems with serializable subgoals 
are likely to be easier to solve. Williams and Nayak recognized that their spacecraft config- 
uration task was serializable (many real- world problems are not), and, more importantly, 
they developed a fast algorithm for computing the correct order based on the fact that 
the underlying causal graph is acyclic. However, their algorithm makes heavy use of the 
fact that all operators are reversible. Informally, reversibility implies that we can solve our 
subgoals one by one as long as they are consistent with some topological order of the causal 
graph without taking into account any global considerations: any side-effect can always 
be undone. Without the assumption of operator reversibility, it is relatively easy to show 
that Williams and Nayak's algorithm works only if the causal graph forms a directed chain. 
Even when the causal graph is a tree, although the problem is easy, one must take care in 
the choice of which subgoal to achieve next when operators are not reversible. As we show 
later, when the structure of the causal graph is more complicated than a directed tree then 
either the problem is hard or, if not, a more sophisticated algorithm is required. 

Finally, we note that the existence of reversible operators might make the problem seem 
easier than it actually is. In this paper we present an example of a propositional planning 
problem with unary operators, acyclic causal graph, and totally reversible operators, the 
minimal solution of which is exponentially long in the size of the problem's description. 

2.2 Example 

In order to illustrate the notion of a causal graph, consider the following example, inspired 
by the work of Williams and Nayak (1997) on controlling the main engine subsystem of the 
Cassini spacecraft, in general, and its valve driver circuitry, in particular. 

Each valve VL (on/off) is controlled by a valve driver VLD (open/close), and a safety 
control unit SCU (safe/unsafe). Each driver controls exactly one valve, while a safety 
control unit can control several valves. Commands to the driver are sent via a driver 
control unit, that consist of two switches, and S'^, which can be either on or off. The 
activating states of and are described below. A valve reacts (by a state change) 
to a command from its driver only if (i) the instruction actually involve a state change 
(i.e., an open valve should not be reopened), and (ii) the safety control unit indicates that 
manipulating the valve is safe. In addition, the valve can be closed if the safety control unit 
indicates an unsafe situation. For simplicity of presentation, Table 1 presents the operator 
set for controlling the valves and valve drivers only. The dashed boxes stand for driver 
control units, two switches in each. 

Now suppose that the valves VLi and VL2, with the drivers VLDi and VLD2, re- 
spectively, are controlled by a shared safety control unit SCU. Given the operator set in 
Table 1, the causal graph for controlling this subsystem is presented in Figure 1. 

3. Polytree Causal Graphs 

Starting at this section, we show how, by bounding the structural complexity of the causal 
graph, we can bound the complexity of plan generation. Recall that we use a propositional 



320 



Structure and Complexity in Planning with Unary Operators 



Affected component 


pre 


post 


prv 


VLD 


close 


open 


5' = 1 A 5^' = 


open 


close 


5' = A S'' = 1 


VL 


on 


off 


VLD = dose A SCU = safe 


off 


on 


VLD = open A SCU = safe 


on 


off 


SCU = unsafe 



Tabic 1: A subset of the operator set for the valve circuitry controller example. 




Figure 1: Causal graph for the example. 



language (binary variables) to describe the state of the world, and each operator is described 
by its prevail conditions, single precondition, and single effect (or post-condition). The 
precondition and the effect are two literals, one the negation of the other. 

A causal graph forms a polytree if there is a single path between every pair of nodes in 
the induced undirected graph^, i.e., the induced undirected graph is a tree. For example, the 
causal graph presented in Figure 1 forms a polytree. For this class of problems we present 
a planning algorithm which is polynomial if the indegree of all nodes in the causal graph is 
bounded by a constant. We argue that this assumption is reasonable if the prevail depen- 
dencies reflect the inter-composition of some controlled hardware components (Williams Sz 
Nayak, 1997). 

Given a propositional planning instance with a polytree causal graph, we: 

1. Provide a general upper bound for the number of times that a variable may be required 
to change its value on a valid, irreducible plan. 

2. Using this general upper bound, provide a polynomial time procedure, called DETERMINE- 
MAX-SEQUENCE, that, given a variable v, determines the actual maximal number of 
times that v can change its value on a valid, irreducible plan. 

3. Provide a preprocessing algorithm that: (a) determines whether or not a plan for a 
given problem instance of our class exists, and (b) performs a substantial amount of 



1. These graphs are also known as singly connected DAGs. 



321 



BRAFMAN & DOMSHLAK 



preprocessing for the subsequent step of plan generating. This algorithm is based on 
a top-down execution of DETERMINE-MAX-SEQUENCE on the variables of the given 
problem instance. 

4. If the answer of the plan existence check is positive we run a particular determin- 
istic instance of the POP algorithm^ (Penberthy & Weld, 1992), called pop-pcg, 
that generates the required plan using the information provided by the preprocessing 
algorithm, without backtracking, in linear time. 

Informally, this process is based on the following properties of the planning problems 
with polytree causal graph. First, the bound achieved in step 1 is necessary for the steps 2-3, 
which are the main steps of our technique. By itself, this bound will be valid not only for 
a polytree, but for a wider class of directed-path singly connected causal graphs. However, 
steps 2-3 will be valid for polytree causal graphs only, because of the following properties 
of this form of dependence relation between the variables: 

(i) Given a variable v € V, changing the value of a parent (immediate predecessor) 
w G pred(f) does not require any changes of neither other parents of v, nor their 
predecessors in the causal graph. 

(ii) The number of times that a variable v will be able to change its value along a valid 
plan for a given problem instance depends directly both on these numbers for pred {v) , 
and on the actual ordering of the value changes of pred(v). 

(iii) Prom (i) it follows that all the possible orderings of the value changes of pred (v) are 
legal. In addition, it will be shown that chosing an ordering for the value changes of 
pred(u) will not affect our ability to change the value of any variable except of v. 

(iv) The crucial part of the process (steps 2-3) is basically about finding the right ordering of 
the right number of value changes of pred(i;) for each variable € V. By synchronizing 
these changes to v^s parents appropriately, we can increase the number of possible 
changes to v. 

We start with some notation. First, a valid plan P for a given planning instance 11 
will be called irreducible if any subplan P' of P is not a plan for 11, in the following sense: 
Removal of any subset of (not necessarily subsequent) actions from P makes the resulting 
plan either illegal, or its initial state is not Init, or its goal state is not one of the states 
specified by Goal. The notion of irreducible plans was introduced by Kambhampati (1995), 
where it was exploited for admissible pruning of partial plans during search^. 

2. A short review of the POP algorithm, and the corresponding formalism is provided in Appendix A. For 
those familiar with the algorithm, we note one slight technical change, stemming from the use of unary 
operators. POP uses two fictitious actions Ao and Aoo to capture the initial and goal state, respectively. 
Here, wc replace each of these actions by a set of actions, each with a single effect. The (fictitious) action 
setting the initial value of variable Vi is denoted A° and the fictitious action whose precondition is the 
goal value of variable Vi is denoted A*. 

3. Irreducible plans were called in (Kambhampati, 1995) minimal plans. However, we decided to change the 
name of this concept in order to prevent an ambiguity between "minimal as irreducible" and "minimal 
as optimal" . 



322 



Structure and Complexity in Planning with Unary Operators 



Now, given a planning instance H, let V be the set of all irreducible plans for 11. We 
denote by MaxReq(t') the maximal number of times that a variable t; € V changes its value 
in the course of execution of an irreducible plan for 11. Formally, let Req(P, v) be the number 
of times that v changes its value in the course of execution of a plan P. Then, 



Observe that, for any planning problem with unary operators, a variable must change 

its value at most once for each required change of its immediate successors in the causal 
graph (in order to satisfy the necessary prevail conditions), and then at most once in order 
to obtain the value requested by the goal. Thus, for all variables in V, MaxReq(t') satisfies: 



where succ(t;) denotes the immediate successors of v in the corresponding causal graph. 
Adopting the terminology from (Domshlak & Shimony, 2003; Shimony &; Domshlak, 2002), 
a directed acyclic graph G is directed-path singly connected if, for every pair of nodes s,t G G, 
there is at most one directed path from s to t. The following lemma shows that if the causal 
graph forms a dircctcd-path singly connected DAG then we can bound MaxReq(i;) by n. 
Clearly, all polytrees are directed-path singly connected DAGs, but not vice versa. 

Lemma 1 For any solvable problem instance H with a directed-path singly connected causal 
graph over n variables, for any variable v, we have that MaxReq(i;) < n. 

Proof: The proof is by induction on n. For n = 1 it is obvious that MaxReq(w) < 1. Now 
suppose that when |V| = ra — 1 then for any v &V, 



Let n' be some problem instance for which |V'| = n. Suppose that the variables in V' = 
{vi, . . .Vn} are topologically ordered based on the domain's causal graph. Clearly, f„ is 
a leaf node (i.e., succ (u„) = 0). We will denote by 11 the problem instance obtained by 
removing from the domain, and the corresponding variable set by V. According to Eq. 1, 
for each immediate predecessor v of Vn in the causal graph. 



where newMaxReq(t;) denotes MaxReq(f) with respect to IT'. Generally, since the causal 
graph is directed-path singly connected, for each variable v gV' , 



MaxReq(u) = max{Req(P, f)} 




(1) 



succ{v) 



MaxReq(t;) < n — 1 



newMaxReq(w) < MaxReq(f) + newMaxReq(un) < MaxReq(f) + 1 




if there is a path from v to v, 
otherwise 



'n 



(2) 



and thus, for each v eV', holds 



newMaxReq(?;) < n 



□ 
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Recall that MaxReq(t;) stands for an upper bound on the number of value changes of v 
that may be required by a valid, irreducible plan. However, the maximal achievable number 
of value changes of v, denoted by l\/laxPoss(i') can be greater or less than MaxReq(f ). For 
example, if v has no predecessors in the causal graph, and there are two operators affecting 
V differently, then MaxPoss('i;) = oo. 

We denote the upper bound on the feasible number of value changes of v that may 
be required in a valid, irreducible plan for H by FMaxReq(?;). Informally, no more than 
MaxPoss(u) value changes of v can be required and no more than MaxReq(u) value changes 
of V should be required, thus 

FMaxReq(t') = min(MaxPoss(i;), MaxReq(t')) (3) 

Determining FI\/laxReq(f ) for all variables requires explicit examination of a given prob- 
lem instance. Recall that here we restrict the causal graph of 11 to form a polytree. To 
simplify the presentation, we assume that the goal values are specified for all state variables, 
i.e. Goal G 'D{vi) x . . . x I>(f„). Later we show that this assumption does not affect the 
generality of the algorithm. Denote by and v* the initial and the goal values of v in 11, 
and by A„ C A the set of all operators aff'ecting v. First we examine the root variables of 
the causal graph, then we analyze the rest of the variables. 

Denote by pred(f) the immediate predecessors of v in the causal graph. If pred(?;) = 0, 
then there are at most two operators A~,A'^ in A„: A'^ has v* as its postcondition, while 
A~ has the reverse effect. Since these operators have no prevail condition, if both A~ and 
A'^ are presented in A, then they can be applied one after another an infinite number of 
times. Therefore, from Eq. 3, FMaxReq(T;) = n. If A^ ^ {A~,A^} then we have two cases: 
If the initial and the goal values of v are the same, then we cannot change the value of v 
and reconstruct it later, and thus FMaxReq(t;) = 0. Alternatively, if the initial and the goal 
values of v are different then if A^ = {A^} then we can achieve the goal value of v but only 
once and thus FMaxReq(v) = 1. Otherwise, the goal value of v is unachievable, thus the 
given problem instance is unsolvable. Table 2 summarize this analysis. 





Ar 


FMaxReq((') 


v^ = v* 


{A^ , A^} 


n 




otherwise 







{A^ , A^} 


n 




{At} 


1 




otherwise 


no solution 



Table 2: FMaxReq('u) values for the root variables in the causal graph. 

Now consider a variable v which is presented by an internal node in the causal graph: 
pred (v) = {wi , . . . ,Wh} ^ $■ Observe that the number of possible value changes of v depends 
on and only on: 

1. The initial and the goal values of v, i.e., and v*. 
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2. The set of operators affecting v, i.e., A^. 

3. The maximally possible (but still reasonable) number of times that predecessors of v 
can change their values, i.e., FMaxReq('u;i), . . . , FMaxReq(u;fe). 

4. The actual scheduling of the value changes of the predecessors of v. 

The last point is crucial - it means that in order to determine FMaxReq(f;) we should find 

a particular scheduling of the vahic changes of pred(t;) that allows such a maximal number 
of value changes for v. The corresponding interleaving sequence of v^s values, starting and 
finishing by 1;° and v* respectively, with FMaxReq('u) value changes will be called maximal 
and will be denoted by a{v) (|cr(v)| = FMaxReq('i;) + 1). 

Prom Lemma 1, for 1 < i < k, we have FMaxReq(wj) < n, thus the number of different 
orderings of value changes of pred(f) can be exponential in n. For instance, when, for 
1 < i < k, we have FI\/laxReq(wi) = n, this number of different orderings can be expressed 
as: 

where the correctness of the expression on the left side of the inequality is shown by Lemma 4 
(see Appendix B, p. 347). Clearly, we cannot check all these orderings in a naive manner. 
Following, we provide an algorithm that determines (t{v) in time which is polynomial in n. 

For clarity of presentation we want to distinguish between the different elements of a 
maximal sequence (^{v). Since all variables are binary, we denote the initial value of v, v^, 
by and the opposite value by (black/white). Similarly, bj and Wj will stand for the 
corresponding values of the variable Vi. If so, we can think about all the operators in A 
as described in this language. Likewise, we sequentially number the appearances of each 
value of V on cr{v). For example, b* stands for the ith appearance of the value b^ along 
o-{v). To illustrate this notation, suppose that ^{v) = {true, false}, the initial value of v 
is = true, and FMaxReq(t') = 4. Then, we have: 

b^ = true 
= false 
a{v) = bi • • b^ • • b^ 

First, for every variable v, every operator A G A^, is extended to a set of operators that 
explicitly specify prevail values for all parents of v in the causal graph: If |pred(t;)| = k, 
and the prevail condition of A is specified only in terms of some < k' < k parents^ of 
V, then A is extended to a set of 2*'"'^ operators, where each operator extends A by an 
instantiation of the previously unspecified parents of v. For example, consider a variable v 
with pred(t;) = {u,w}, and an operator 

A = {pre : {b^}, post : {w„}, prv : {b„}}, 

4. For every other parent Wj of v, we have prv(>l)[ji] = u. 
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the prevail condition of which does not involve w. This operator is extended to a pair of 
operators: 

A' = {pre : {b^}, post : {w„}, prv : {b„, b^}} 
A" = {pre : {b^}, post : {w„}, prv : {b„,Wi„}} 

corresponding to the possible values of w. In what follows, we refer to the operator set 
resulting from such a compilation of A as A'^. Note that, under the assumption of constantly 
bounded maximal indegree k of the causal graph, compiling A into A'^ takes only polynomial 
time, since, for every variable v, |A^j < 2'^^^, and thus |A^| = 0{n2'^^^). 

Given the maximal sequences a{wi), . . . , a{wk) and the operator set A^ we construct 
a directed graph (denoted as G'g{v)) that captures all (and only) feasible sequences of, up 
to n, value changes of v, where each value change is annotated with the corresponding 
assignment on pred(u). Although the number of the captured sequences can be exponential 
in n, the size of ) is polynomial in n. With respect to this graph, the problem of finding 
the maximal sequence a{v) is reduced to the problem of finding a longest path from a given 
node to an arbitrary other node in a directed acyclic graph. 

The graph G'^{v) is created in three incremental steps. At the first step, given the 
maximal sequences a{wi), . . . , a{wk) and the operator set A^ we construct a directed labeled 
graph G{v) capturing information about all sequences of assignments on pred{v) that can 
enable n or less value flips of v. The graph G{v) is defined as follows: 

1. G{v) consist of ij nodes, where 

{n, ((n = 2j) and {v^ = v*)) or 
((n = 2j + 1) and / v*)), j G N 
n — 1, otherwise 

2. G{v) forms a 2-colored multichain, i.e., (i) the nodes of the graph are colored by black 
and white, starting by black; (ii) there are no two subsequent nodes with the same 
color; (iii) for 1 < z < — 1, edges from the node i are only to the node i + 1. 

Observe that such a construction of G{v) promises that the color of the last node will 
be consistent with v* . 

3. The nodes of G{v) are denoted precisely by the elements of the maximal sequence 
cr(v), i.e., b* stands for the zth black node in G{v). 

4. Suppose that there are m operators in A^ that change the value of v from b„ to w„. In 
this case, for each i, there arc m edges from b* to w* , and |A^| — m edges from w* to 
b^"*"^. All edges are labeled by the prevail conditions of the corresponding operators, 
i.e., a /c-tuple of the values of wi, . . . ,Wk- This tuple is denoted by l{e) (label of the 
edge e) and its component, corresponding to a predecessor Wi, is denoted by ^(e)^-. 

This formal definition of G{v) is relatively complicated, thus we provide a demonstrating 
example: Suppose that we are given a problem instance over 5 variables, and we consider 
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a variable v with pred(v) = {u,w}, = b^, and v* = Wy. Recall that every operator in 
A'^ is presented as a three-tuple (pre, post, prv) of pre-, post-, and prevail conditions of the 
operator respectively. Suppose that: 

a{u) = • a{w) = b^ • w^, • b^ • 

{Al = {pre : {b„}, post : {w^}, prv : {b„,w^}} 
Al = {pre : {w„}, post : {b^}, prv : {b„, b^}} 
Al = {pre : {w„}, post : {b^,,}, prv : {w„,w^}} 

For this case, the graph G{v) is presented by Figure 2. 



Figure 2: Example of the graph G{v). 

The constructed graph G{v) captures information about all potentially possible execu- 
tions of the operators in that can provide us MaxReq(f) or less value changes of v. 
Each path, started at the source node of G{v), uniquely corresponds to such an execution. 
Although the number of these alternative executions may be exponential in n, this graph- 
ical representation is compact: the number of edges in G{v) is 0{n ■ |A^|). Note that the 
information about the number of times that each operator in A^ can be executed is not 
captured by G{v). The following two steps add this information indirectly and exploit it to 
find a maximal sequence (t{v). 

At the second step of construction, wc expand G{v) with respect to the maximal se- 
quences (t{wi), . . . , cr{wk) as follows: Each edge e G G{v) (which by definition corresponds 
to some operator A € A^), is replaced by a set of edges such that their labels correspond to 
all possible assignments of the elements of a{w\), ... , a{wk) to l{e) (i.e., prv(Y4)). Likewise, 
we add a dummy source node Sy, with an edge from s^ to the original source node of G{v) 
labeled by a tuple of the first elements of (t(?y;i), . . . , (T{wk) (= initial values of tt;i, . . . , w^). 
Similarly, we add a dummy target node t^, with an edge from the original target node of 
G{v) to ty labeled by a tuple of the last elements of a{wi), . . . ,a{wk) (= goal values of 
wi, . . . ,Wk). We denote this extended graph by G'{v), and Figure 3 illustrates G'{v) for the 
example above. 

The extended graph G'(v) can be viewed as a projection of the maximal sequences a(wi), 
1 < i < k, on the graph G{v). Each edge in G{v) may be replaced by 0{n^) edges in G'{v), 
and thus the number of edges in G'{v) is 0{n^~^^ ■ |A^|). 

It is easy to see that not all paths in G'{v) starting at s^ are relevant. For example, in 
G'{v) above, an operator instance prevailed by b^b^ can not be performed after an operator 
instance prevailed by b^w^. Thus, now we are faced with the problem of finding a longest 
feasible path from s„ to a node in G'{v), the label of which is consistent with v*. The 
following (last) step provides a reduction of the problem of finding a longest feasible path 
from Sy to a w*-colored node in G'{v) to a known problem of finding a longest path in a 
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Figure 3: Example of the graph G'{v). 



directed acychc graph. Let the graph G'^{v) have the edges of G'{v) as nodes, and let its 
edges be defined by all allowed pairs of immediately subsequent edges in G'{v): (e,e') is 
allowed if, for 1 < i < fc, either l{e)u)i = K^')wi or ^(e')«,. appears after l{e)yj^ on a{wi). Such 
a construction is a variant of a so called "edge graph" known in graph theory; the addition 
in our case is the exclusion of non-allowed edges from it. Clearly, G'^{v) can be constructed 
in time polynomial in size of G'{v), and the number of edges in G'^{v) is 0{n^^^'^ ■ |A^p). 




Figure 4: Example of the graph G'f,{y). 



Figure 4 presents G'^{v) for our example. The dashed edges present the longest path 
from the dummy source node to a node that corresponds to a value change from ^v* to v* 
(from b„ to w„). Such a longest path in G'^{v) describes a maximal sequence of value changes 
(t{v), and its length is actually FMaxReq(f ) + 1. In our example, a{v) = • • • w^, and 
FMaxReq(u) = 3. Note that if = v* then the empty path will be also acceptable since, 
in general, v does not have to change its value. In this case FMaxReq(f ) = and cr{v) will 
consist of only one element which corresponds to the initial (= goal) value of v. 

Observe that a longest path in G'^{v) describes not only a{y) but also the actual sequence 
of invocations of the operators from that provides (j{v). We denote by {A(bi)} and 
{A(wi)} the sequences of operator instances that have as effects the corresponding elements 
from the sequences {b^} and {w:?,} ({b:^} U {w:?,} = (y{v)) of u's values, respectively. In 
what follows, we address these sequences of operator instances as one sequence of operator 
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Procedure forward-check (11) 

1. Topologically sort all variables V based on the the causal graph. 

2. For each variable u € V, call DETERMiNE-MAX-SEQUENCE(n, -u), respecting the above 
ordering. 

3. If one of the calls to determine-max-sequence return failure, then return failure. 
Otherwise return success. 

Procedure determine-max-sequence (E, v) 

1. If pred(u) = then 

(a) If v° 7^ V* and ^ A^, return failure. 

(b) Otherwise, determine a{v) according to the rules in Table 2, and return success. 

2. Otherwise, if pred(t;) = {wi, . . . ,Wk} then 

(a) Construct G{v) (based on v^,v*, and A^). 

(b) Construct G'{v) (from G{v), based on a{wi), . . . a{wk))- 

(c) Construct G'^{v) (from G'{v), based on a{wi), . . . a{wk))- 

(d) Determine the longest path in G'g{v) to a node corresponding to a f *-ended value 
change, and derive a{v) and the corresponding sequence of operators from it. 

(e) If ^ V* and FMaxReq(v) = 0, return failure. Otherwise, return success. 



Figure 5: FORWARD-CHECK algorithm 



instances = {^Aiy'-^yf^^^^^^^^ ^ where Aiy]^ has v\ as its effect, and 

A; G N 



' i+i 
W , i = 2k + l 



Wy , i = 2k 

Procedure FORWARD-CHECK in Figure 5 summarizes the presented approach. Note that 
finding a set of longest paths from a node to all other nodes in a directed acyclic graph 
can be done in time linear in the size of the graph (Wiest & Levy, 1969). Therefore, the 
time complexity of a call to the determine-max-sequence procedure with a variable v is 
bounded by the size of the constructed graph Gg(w) and thus is 0{n'^'''^'^ ■ |A^p). FORWARD- 
CHECK calls determine-max-sequence n times. Therefore, if the maximal node indegree 
is bounded by a constant k, then the overall complexity of the algorithm is 0{\V\'^'^~^^ ■2^'^'^'^) , 
i.e., polynomial in the size of the problem description. 

Theorem 1 A given problem instance with a polytree causal graph is solvable if and only 
if, for each v £V, FORWARD-CHECK succeeds in constructing the maximal sequence cr{v). 

FORWARD-CHECK fails if and only if at least one of the calls to the determine-max- 
SEQUENCE procedure fails. In turn, a cah to determine-max-sequence on a variable v 
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Algorithm: POP-PCG ((.A, O, £), agenda, A) 

1. Termination: If agenda is empty, return {A,0,C) 

2. Goal selection: Let {di,Aneed) be a rightmost pair on the agenda (by definition, 
Aneed ^ and i?j is one of the pre/prevail conditions of ^„eed)- 

3. Operator selection: 

(a) If Aneed + A* = uf) then Aadd = A{i4) G U {A^,}. 

(b) Otherwise: 

i. Let m = max{ j \ A{vl) £ A}. 

ii. If V* is consistent with z^™ (both associated with the same color from {b, w}) 
then A^ad = A{v^), else A^m = A{u'^+^). 

4. Plan updating: Let £ = £ U {Aadd ^ ^need}, and let O = U {Aadd < A^eed}- If 
Aadd is newly instantiated, then ^ = ^ U {Aadd} and O = OU < Aadd < A*} 
(otherwise A and O remain unchanged). 

5. Update goal set: Let agenda = agenda - ^need)}- If Aadd is newly instantiated, 
then for each of its pre/prevail conditions Q, add {Q,Aadd) to agenda. 

6. Threat prevention: If Aadd = A{i'i), j > 1, then, for each A £ A, s.t. uf~^ belongs 
to the prevail conditions of A, add {^4 < A{uj)} to O. 

7. Recursive invocation: POP-PCG((yl, O, £), agenda. A), where agenda is topologi- 
cally ordered (based on the causal graph with respect to the precondition part of each 
pair). 

Figure 6: POP-PCG algorithm 

fails if and only if the initial and the goal values of v are different but there is no way to 
change the value of v even once. Thus, if FORWARD-CHECK fails, then no plan exists. 

To prove the opposite direction we proceed as follows: We define the pop-pcg algorithm 
(POP for polytrce causal graphs) and show that it will succeed without backtracking if 
FORWARD-CHECK succeeds.^ POP-PCG is described in detail in Figure 6, and it works as 
follows: First, let us expand each sequence of operator instances Fj by A(uj) (A{bl)) which 
will stand for the dummy operator A^. (Recall that up until now, only operators of the 
form A{i'j) for j > 1 were defined.) The algorithm maintains a goal agenda sorted based 
on the causal graph structure: parent variables appear after their descendents. At each 
point, the next agenda item is selected; if it requires achieving some value for Uj we add the 
corresponding operator to the plan with the desired effect (step 3a). Actually, if we would 
be ready to accept plans with possible redundant steps, we can omit the next step 3b from 
the algorithm by assuming that the goal value of each variable v is the last element of the 

5. For a short review of the POP algorithm, the corresponding formalism, and the description of the initial 
call to the algorithm, we refer the reader to Appendix A. 
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maximal sequence (7{v). However, if wc would like our plan to be irreducible, then a careful 
decision about the really required number of value changes of each variable is required. 
This decision is captured in step 3b by analysis of the value changes of a variable Vi that 
were found necessary in the previous iterations of the algorithm in order to satisfy the 
predecessors of Vi in the causal graph. Note that the agenda is sorted with respect to some 
reverse topological ordering of the causal graph, thus if an operator affecting Vi was selected 
from the agenda then no operator affecting some predecessor of Vi in the causal graph will 
appear on the agenda until the end of the algorithm. No threats arise in POP-PCG, and the 
ordering constraints are consistent. 

Lemma 2 // FORWARD-CHECK was successful then POP-PCG will return a valid plan. 
Proof: The lemma will follow from the following claims: 

1. For every agenda item, there exists an operator that has it as an effect. 

2. There are no threats in the output of POP-PCG. 

3. The ordering constraints in O are consistent. 

4. The agenda will be empty after a polynomial number of steps. 
For the proof see Appendix B, p. 343. □ 

Recall that, for simplicity of presentation, we assumed that the goal values are specified 
for all state variables (single goal state), i.e. Goal G X'(t'i) x . . . x 'D{vn)- Now we show that 
the presented approach, with minor modifications, works for a set of possible goal states as 
well, if such a set is specified by a partial assignment on V, i.e. Goal G 'D~^{vi)x . . . xV^{yn)- 
Note that the latter assumption is widely accepted in the planning literature. 

First, no modifications should be done in processing variables that are specified by Goal. 
Now, for each variable v, such that v* is not specified by Goal, the modifications are as 
follows: 

1. The graph G{v) will consist of exactly n nodes. This is correct since (i) according to 
Lemma 1, n changes of v have to be sufficient, and (ii) any value change of v can be 
its last value change. 

2. No changes in construction of G'{v) and G'^{v). 

3. In the procedure determine-max-sequence: 

(a) In step 2d, determine the longest path from the dummy source node to any other 
node in the graph. 

(b) In step 2e, always return success. 

Again, this is correct since any value change of v can be its last value change, and, in 
particular, v may remain unchanged in a plan for a given problem. 

Finally, the POP-PCG algorithm starts with a null plan that contains the end operator 
A* only if v* is specified by Goal. 
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4. Directed-Path Singly Connected and General DAGs 

In this section we analyze planning complexity in face of more complicated causal graphs. 
First, we show that when the causal graph is directed-path singly connected even plan 
existence is NP-complete. Second, we show that for general causal graphs the situation is 
even worse. Finally, we characterize an important parameter of the causal graph affecting 
planning complexity, which allows us to extend the class of problems which are in NP. 

Theorem 2 Plan existence for strips planning problems with unary operators and directed- 
path singly connected causal graph is NP-complete. 

Proof: For the proof see Appendix B, p. 346. □ 

Note that node indegree in the causal graph of the problem created in the proof of 
Theorem 2 is bounded by 6. The hardness for directed-path singly connected causal graphs 
with maximal indegree lower than 6 is thus open. 

The directed-path singly connected structure of the causal graph turns out to be crucial 
for guaranteeing reasonable solution times. As we now show, there are solvable proposi- 
tional planning problems with an arbitrary acyclic (DAG) causal graph that have minimal 
solutions of exponential size. Analysis of this class of problems points to the reason for such 
provable intractability. This allows us to characterize an important parameter of the causal 
graph affecting planning complexity and to extend the class of problems which are in NP. 
However, all these restricted problems are still NP-complete. 

Theorem 3 Plan generation for general STRIPS planning problems with unary operators 
and acyclic causal graph is provably intractable, i.e. it is harder than NP. 

This theorem follows from Theorem 5.4 in (Jonsson Sz Backstrom, 1998b), that shows 
that plan generation for the 3S problem class is provably intractable. The point is that the 
upper bound for MinPlanSize, presented in Eq. 5, can be exponential in the size of the input 
in this case. First, we show by example that this upper bound can be achieved, then we 
present some analysis of the reasons for this intractability. 

The following example shows that an exponential upper bound can be achieved. It was 
used in the proof of Theorem 5.4 in (Jonsson & Backstrom, 1998b), and was originally 
presented in a different context by Backstrom and Nebel (1995). Consider a propositional 
planning problem with |V| = n, where, for 1 < z < n, V{vi) = {0,1} and pred(ui) = 
{vi, . . . Vi-i}. The operator set A consist of 2n operators {Ai, A'^, . . . where 






1 



u 



if j < i - 1 
if J = i - 1 
otherwise 
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It is easy to see that the causal graph of this problem forms a DAG (see Figure 7), and 
an instance of this planning problem with the initial state (0, . . . , 0) and the goal state 
(0, . . . , 0, 1) has a unique minimal solution of length 2*^ — 1 corresponding to a Hamilton 
path in the state space. 




Figure 7: Causal graph for the proof of Theorem 3 

Now we show that this escalation in complexity can be "parametrized" by the form of 
the causal graph. 

Lemma 3 For any solvable problem instance H with an acyclic causal graph over n vari- 
ables, for any variable v, we have that: 

n 

MaxReq(fi) < 1 + ^ pivi,Vj) 
j=i+i 

where p{vi,Vj) denotes the total number of different, not necessary disjoint, paths from Vi 
to Vj, where variables are ordered via a topological sort of the causal graph. 

Proof: The proof is by induction on i. For i = n it is obvious that MaxReq(f„) < 1. 

Now we assume that the lemma holds for any i > k, and prove it for i = k. Without loss 
of generality, assume that succ(i;fe) ^ 0. Otherwise, we simply have that MaxReq(?;fe) < 1. 
The proof is straightforward: 

MaxReq('Ufe) < 1 + 2^ MaxReq('yjJ < 

Vii^esucc{vk) 

I H " 
< l + |sUCc(Ufe)|+ J2 5Z P('"ik^'"j) = 

Vif,esucc{vk) j=ifc+l 

n 

j=k+l 

□ 

Lemma 3 entails that the upper bound for MinPlanSize(n) for a general planning problem 
with unary operators and acyclic causal graph depends on the number of different paths 
between the nodes in the causal graph. An immediate conclusion is that there is a significant 
class of problems with an acyclic causal graph for which planning is in NP. Let a DAG be 
called max-S- connected if the number of different directed paths between every two nodes 
in this graph is bounded by 6. 
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Theorem 4 Plan generation for STRIPS planning problems with unary operators and max- 
S-connected causal graph is NP-complete if S is polynomially bounded. 

Proof: Membership in NP is straightforward: If the variables of a given problem 11 are 

considered in a topological ordering induced by the causal graph, then from Lemma 3 follows 
that, for any variable Vi, MaxReq(t;j) < Sn. In turn, from this follows that MinPlanSize(n) < 
dn^, and thus, if 6 is polynomially bounded, then we can guess a minimal plan for 11 that 
could be verified in polynomial time. 

The hardness follows from Theorem 2 that shows that even if the causal graph is max-1- 
connected (directed-path singly connected), then plan existence (and thus plan generation) 
is hard. □ 



5. Serializable Subgoals 

A set of subgoals is defined to be serializable (Korf, 1987) if there exists an ordering among 
the subgoals such that the subgoals can always be solved sequentially without ever violating 
a previously solved subgoal in the order. Naturally, not all collections of subgoals are 
serializable - sometimes it may be necessary to interleave plans for achieving different goals. 
However, when a problem instance is serially decomposable, it is possible to design a set of 
macro-operators with respect to which the subgoals are serializable (Korf, 1985). 

A problem instance is serially decomposable if there exists some ordering of the state 
variables for which the cfi"ect of each operator on each state variable depends only on that 
state variable and previous state variables in the ordering. Unfortunately, Bylander (1992) 
shows that determining serial decomposability of a problem is PSPACE-complete. 

One major open problem put forth by Bylander in this context is: // a problem is 
known to be serially decomposable, how difficult is it to determine whether a given instance 
is solvable? As far as we know, the only work in this direction was done by Chalasani 
et al. (1991), where the serial decomposability of the "general permutation problem" was 
considered. In particular, they showed that this problem is in NP, but it is unknown 
whether it is NP-hard. Recently, some complementary results for Bylander's question were 
presented Koehler and Hoffmann (2000). Our results shed more light on this question: 
Any problem instance based on a unary operator domain whose causal graph is acyclic is 
serially decomposable. Therefore, it can be concluded that finding a solution for serially 
decomposable problems may require exponential time (i.e., the problem is in EXPTIMe). 
However, Bylander's question is about plan existence. In that case, Theorem 3 does not 
apply, and we can only apply our NP-hardness result (for directed-path singly-connected 
graphs), since it addresses plan existence as well. 

Weld (1999) hypothesized that: (1) If the underlying causal graph of the planning 
problem is acyclic, then a serialization ordering on the subgoals of the problem is obvious; 
(2) Serialized subgoals could be solved extremely quickly because no backtracking is required 
between them. Although the first observation sounds intuitive, our results suggest that it 
is rarely true. The acyclicity of the causal graph implies serializability, but in most of the 
cases its structure does not provide us sufficient information about the actual serialization 
ordering. Even when the causal graph is a directed tree one must think first before choosing 



334 



Structure and Complexity in Planning with Unary Operators 



an ordering. Likewise, our results imply that when the causal graph does not form an 
undirected tree determining a subgoal ordering is NP-complete, and if the causal graph is 
not directed-path singly connected, the problem is even more complex. 

The second observation is not always true either. The problem is that it is important to 
determine not only the serialization ordering over the subgoals, but also the exact strategies 
for achieving them. As wc showed, in certain cases, a problem with n serializable subgoals 
requires an exponentially long solution. When the domain variables are not binary, the 
situation is even worse - some of the corresponding complexity results can be derived from 
the computational analysis of Domshlak and Dinitz (2001). 

6. Connection with Related Work on Planning Complexity 

The idea of analyzing and exploiting structural properties is not new to classical planning, 
and in the last few years a number of important results have emerged. Generating plans 
in the context of the STRIPS representation language was shown by Bylander (1994) to 
be PSPACE-complete. Despite this fact, the existence of many successful planning systems, 
especially in recent years, demonstrates that planning is possible and practical for a wide list 
of domains. Bylander argues that the large gap between the theoretical hardness of planning 
and its practical success stems from the use of domain-dependent problem analysis and 
algorithms. Consequently, various authors have explored the existence of some constrained 
problem classes for which planning is easier. 

In this section we shortly overview some of the major, previous results on complexity 
of planning, and discuss their relationship to the results presented in this paper. For a 
more detailed presentation of the previous results discussed below we refer the reader to 
the original papers. 

6.1 Local Syntactical Restrictions 

In his seminal paper, Bylander (1994) presents a number of complexity results for preposi- 
tional planning, analyzing different planning problems based on the type of formulas used, 
the number and type (positive/negative) of operator pre- and postconditions, etc. The work 
of Bylander is extended by some interesting, complementary results by Erol at al. (1995). 
For example, Bylander shows that propositional planning in domains where each operator 
is restricted to have positive preconditions and one postcondition only is tractable. Gen- 
erally, extremely severe restrictions on operators are required to guarantee tractability, or 
even membership in NP. Note that Bylander (1994) and Erol et al. (1995) focuses on local 
syntactical properties of operators, i.e., properties of single operators. 

The only syntactic restriction that we pose on the planning problems in this paper is 
the unarity of the operators. Determining plan existence for this, apparently easier class 
of problems was shown by Bylander to be as hard as general propositional planning, i.e. 
PSPACE-complete. Note that this result by itself does not entail our Theorem 3, since 
planning problems with unary operators may induce causal graphs with cycles. Therefore, 
none of our results is entailed by the results presented by Bylander (1994) and Erol et 
al. (1995). 
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6.2 Global Syntactical Restrictions 

Backstrom and Klein (1991a, 1991b), and, subsequently, Backstrom and Nebel (1995), 
consider other types of restrictions, but using a more refined model (the SAS formalism) in 
which: 

1. The state variables are multi- valued, and 

2. Two types of preconditions are considered: prevail conditions, which are variable 
values that are required prior to the execution of the operator and are not affected by 
the operator, and preconditions, which arc affected by the operator. 

In general, four different restrictions were considered in these works: 

(P) Post-uniqueness: For each effect there is at most one operator that achieves this effect. 
In other words, desired effects determine operators to be used in a plan. Formally, a 
problem instance is post- unique if and only if, for each Vi eV and x eV{vi), there is 
at most one operator A e A such that post(A)[i] = x. 

(S) Single-valuedness: At most one value of each state variable appears in the prevail 
conditions of the operators. For instance, if a certain operator requires the light 
to be on (as a prevail condition), no other operator can use the prevail condition 
that the light is off. Formally, a problem instance is single-valued iff there exist no 
two operators A,A'gA and Vi E V such that prv(^)[i] ^ u, prv(A')[i] 7^ u, and 
prv(A)[z]^prv(^')M- 

(U) Unariness: Each operator affects only one state variable. 

(B) Binariness: All state variables have exactly two possible values, i.e. all state variables 
are propositional. 

All these four properties are syntactical. However, the properties P and S differ from the 
properties U and B by the fact that they have a global nature: Post-uniqueness and single- 
valuedness restrict not the form of the operators, but a global property of the whole set of 
operators. Backstrom and Nebel (1995) showed that US (unariness and single-valuedness) 
is the extreme problem class for which plan generation is polynomial. ^ 

The problems that we analyzed in this paper belong to the problem class UB, by defini- 
tion. As we already mentioned, even determining plan existence for this class of problems 
is PSPACE-complete. Now consider the problem class PUB. Backstrom and Nebel (1995) 
showed that: (i) PUB has instances with exponentially long minimal solutions, thus plan 
generation for PUB is requires exponential time; (ii) existence of bounded length plans for 
PUB is strongly NP-hard; and (iii) the complexity of general plan existence for PUB is still 
an open question. Informally it means that strengthening restrictions from UB to PUB 
does not reduce the complexity significantly, at least from the practical point of view. 

Proposition 1 Every UB problem, instance with a tree causal graph is either post-unique, 
or can be transformed into an equivalent post-unique problem instance in (low) polynomial 
time. Thus, TreeUB C PUB. 

6. For a thorough analysis of the complexity of SAS planning, we refer to Backstrom and Nebel (1995). 
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Proof: Consider a UB problem with a tree causal graph, and suppose that it is not post- 
unique. It means that there exist a variable f G V, with I'(v) = {v',v"}, such that there 
exist two operators ^1,^2 £ that change the value of v from v' to v", and prv(74i) 7^ 
prv(A2). 

Prom the assumption that the causal graph forms a tree it follows that |pred('i;)| < 1. 
If pred{v) = 0, then it is easy to see that the existence of such a pair of operators is 
simply impossible. Therefore, let pred(?;) = {w}, where T)(w) = {w',w"}. Without loss of 
generality assume that prv(y4i)[i(;] = {w'}, pr\/{A2)[w] = {w"}. Otherwise, if, for instance, 
prv(^i)[u;] = u, then it is easy to see that A2 is a redundant operator. 

Observe that in this case, prevail dependence of v on w is redundant: We can replace 
the pair of operators Ai,A2 in A by a single operator A that changes the value of v from 
v' to v" without any prevail condition. The replacement of Ai ,A2 hy A brings us to an 
equivalent problem instance in which the operator set A^, is post-unique. This way we 
continue to process iteratively all such "problematic" variables v until we arrive at a post- 
unique problem instance. □ 

Proposition 2 There are UB problem instances with a tree causal graph that are not single- 
valued, thus TreeUB UBS. 

Proof: The proof of Proposition 2 is straightforward: Consider a variable G V, T>{v) = 
{v' ,v"}, such that succ(f) = {u,w}. It can be the case that any value change of u will be 
prevailed by v' , while any value change of w will be prevailed by v" . Therefore, restricting 
causal graphs even to trees does not entail single- valuedness.^ □ 

Propositions 1 and 2 show that TreeUB is a polynomial subclass of PUB that is not 
entailed by any tractability results of Backstrom and Nebel (1995). 

Proposition 3 There are UB problem instances with a polytree causal graph that are nei- 
ther single-valued, nor post-unique. 

Proof: The proof is straightforward: Consider a planning problem with a polytree causal 
graph, such that there exist a variable v ^ V with pre6{y) = {u,w}, and the following 
operator set A„: 



pre 


post 


prv 


v' 


v" 


{u',w"} 


v' 


v" 


{u",w'} 


v" 


v' 


W,w'} 


v" 


v' 


{u",w"} 



Clearly, any problem instance with such A^ C A is neither single- valued, nor post- 
unique, since (i) there is more than one operator achieving any value of v, and (ii) both 
values of u (and both values of w) appear in prevail conditions of the operators in A^. Note 



7. Using the simple construction technique from the proof of Proposition 1 it can be shown that restricting 
causal graphs to directed chains only does entails single-valuedness. However, this case is too restrictive. 
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that the maximal indegree of such a polytree can be minimal, i.e. equal to 2. Thus, the 
proposition is valid for any polytree that is not a tree. □ 

Prom Proposition 3 it follows that Theorems 1 and 4 introduce new polynomial and 
NP-easy subclasses of the UB problem class, respectively. 

6.3 Structural Restrictions in Prepositional Planning 

Jonsson and Backstrom (1998b) present the 3S class of planning problems. This class is most 
closely related to the problems examined in this paper, since it defines a special subclass of 
problems with binary variables, unary operators and acyclic causal graphs. The 3S problem 
class is defined by posing some additional, relatively severe, restrictions on the problem's 
operator set: Each variable f in a 3S problem instance is required to be either (i) static, 
i.e., unchangeable; (ii) symmetrically reversible, i.e., for each operator A affecting v, there 
exist an operator A' affecting v with the same prevail conditions and the opposite effect; 
or (iii) splitting. For the formal definition of the splitting property we refer to Jonsson 
and Backstrom (1998b). Informally, if a binary variable v is splitting then the problem 
instance can be split into three, well-defined subproblems that can be solved independently. 
For this class of planning problems it was shown that plan existence can be determined in 
polynomial time, while plan generation is provably intractable, since there are instances of 
3S with exponentially long minimal solutions. In particular, the problem instance that we 
used in the proof of Theorem 3 is in 3S. 

The complexity analysis by Jnonsson and Backstrom (1998b) is somewhat unique in 
the research on complexity of propositional planning, since, to the best of our knowledge, 
this was the only attempt to exploit not only syntactical restrictions on the operator set, 
but also some structural restrictions on interaction between the variables. Our analysis 
can be seen as continuing this direction by looking on the structural restrictions only. We 
believe that eliminating the marginal effect of the problem structure on the problem's (po- 
tential) hardness will allow us to understand better the connection between the component 
interactions topology, and the potential complexity of the problem. 

6.4 Structural Restrictions in Multi-valued Formalisms 

When the variables are no longer propositional, some additional properties of the problems 
can be identified, and, possibly, exploited. In particular, additional internal structures of 

the problem can be analysed. 

Jonsson and Backstrom (1998a) analyze different properties of a multi-valued problem 
structure, which is called the domain transition graph. Such a structure is defined for each 
state variable of the problem, and it describes possible transitions between different values 
of this variable. The domain transition graph of a state variable ?; is a directed labeled 
graph Gv = (y,E), where V is associated with the v's set of possible values, T>{v), and 
{x, A,y) & E if and only if the operator A can be applied at some state in which v = x, and 
its application results in a state in which v = y holds. 

Jonsson and Backstrom identify sets of structural restrictions on domain transition 
graphs which make planning instances tractable. Roughly, the properties arc the following: 
(1) The problem domain is interference-safe, i.e., each operator is either unary or irreplace- 
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able with respect to every variable it affects. An operator A is irreplaceable with respect 
to a variable v if the removal of all edges from that stem from A disconnects some 
weakly connected component of Gy. (2) For every variable v, the graph G„, restricted to 
the set of values that appear in the prevail conditions of some operators, is acyclic. (3) Any 
sequence of operators annotating a path from a; to y in the domain transition graph of v, is 
stronger than all shortest such sequences connecting x and y. Here, a sequence Ai, . . . , A^. 
is stronger than A[, . . . , A'^ if there is a subsequence Ai-^ , ■ ■ ■ , A^ of , . . . , such that for 
every 1 < j < Z, the prevail conditions of Aj are a subset of the prevail conditions of Ai^. 
Jonsson and Backstrom present a map of the computational complexity of problems with 
different restrictions, displaying the frontier between the tractable and intractable cases. 

Each domain transition graph combines and structures the influence of many operators 
on a particular variable. Therefore, they provide us a more global picture than the operator 
set alone. Hence, in spite of the fact that domain transition graphs do not capture the rela- 
tionship between different variables, they do allow us to express some structural properties 
that address interactions between the variables (e.g., see property (2) above). 

Observe that domain transition graphs are not very informative in the case of proposi- 
tional planning, since they are only distinguish between the variables that can be changed 
only in one direction and the variables that can be changed in both directions. Although 
this property of domain transition graphs allows to distinguish between the polynomial 
planning with only positive postconditions, and the PSPACE-complete planning with both 
positive and negative postconditions (Bylander, 1994), it seems to be not very helpful in 
further hierarchical refinement of the propositional planning complexity. On the other hand, 
there is no a priori reason why the causal graphs will not be informative in the multi- valued 
case. Exploiting the properties of causal graphs, together with the properties of domain 
transition graphs, seems to be a natural direction to extend the work presented in this paper. 
The recent work of Domshlak and Dinitz (2001) on multi-entity off-line coordination can be 
seen as investigating connections between the structure of the causal graph, together with 
the properties of the domain transition graphs, and the complexity of the corresponding 
problems in case of multi-valued domains. To the best of our knowledge, this is the only 
work that was done with respect to such a "mixed" structural analysis, and a lot of work 
remains to be done. For instance, combining various properties of the domain transition 
graphs studied by Jonsson and Backstrom (1998a), with the properties of the problem's 
causal graph is a direction for the further research. 

7. Summary and Future Work 

We have shown that the form of the causal graph for strips planning problems with unary 
operators is an important factor in determining the computational complexity of plan gen- 
eration. In particular, we have shown that a polynomial time algorithm exists for any 
problem with a polytree causal graph and the node indegree bounded by a constant. More 
generally, this result shows that planning with polytree causal graphs is at most (what is 
often referred to in the Bayes nets literature as) locally exponential, i.e., it is exponen- 
tial in the maximal number of parents of a node. Note that in hardware-control planning 
problems the maximal node indegree is expected to be small, since prevail dependencies 
between the variables reflect the direct interconnections between the corresponding hard- 
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ware components. Likewise we have shown that for a problem with directed-path singly 
connected causal graph the maximal plan length is a low order polynomial, but the problem 
is NP-complete. More generally, we have shown a relation between the number of paths be- 
tween variables in the causal graph and the computational complexity of the corresponding 
planning problem. Finally we have presented the impact of our results on the question of 
complexity of planning problems with serializable subgoals, and connected our work with 
previous results on planning complexity. 

Our work leaves a number of open questions with respect to purely syntactical, and a 
mixture of both structural and syntactical restrictions on the planning problems with unary 
operators. In the former case, one of the most important directions is a further analysis of 
causal graphs with constantly bounded node indegree. It turns out that complexity anal- 
ysis for this class of problems will be very helpful in understanding various computational 
properties of CP-nets (Boutilier et al., 1999). Although here we provided a partial answer 
for this question, the general picture of the worst-case complexity for this class of problems 
is not clear. For example, if the indegree of the causal graph is known to be bounded by 
2, and this is the only structural property of the causal graph, it is even not clear whether 
this problem subclass is in NP. 

In the latter case, various syntactical restrictions can be analysed together with the form 
of the causal graph. For example, one may be interested in the computational properties 
of the problems with acyclic causal graphs, and the restriction that every operator has at 
most a prevail conditions, where a is bounded by a constant. This, as well as many other 
related questions with respect to various special cases of planning with unary operators are 
of interest for the future work. 
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Appendix A. A Short Review of POP, Causal Links and Threats 

We represent a plan as a tuple: (.4, 0,C), where .4 is a set of unary operators, O is a set of 
ordering constraints over A, and C\s a set of causal links. For example, if ^ = {^i, ^3} 
then O might be the set {Ai < A^,A2 < A-^}. These constraints specify a plan in which 
^3 is necessarily the last operator, but do not commit to a particular order on Ai and A2. 
Naturally, the set of ordering constraints must be consistent, i.e., there must exist some 

total order satisfying them. A causal link has the form Ap Ac, where Ap and Ac are 
operators and 'di is a possible value for some propositional variable Vi. It denotes the fact 
that Ap produces (i.e., has the postcondition) Vi = -di which is consumed by Ac (i.e., used to 
satisfy a pre- or prevail-condition of Ac). Causal links help us detect whether one operator 
At interferes with the work done to enable the execution of some other operator Ac. In that 

case, At is said to constitute a threat to one of A'^s causal links. Formally, suppose that 

1?- 

{A, O, C) is a plan, and Ap ^ is a causal link in C Let At be a different operator in A. 
We say that At threatens Ap — ^ Ac when the following two criteria are satisfied: 

• OU {Ap < At < Ac} is consistent, and 

• At has as an effect. 

When a partial order plan P contains threats, it is possible that the goal will not be 
achieved by some (or all) of the total order plans consistent with P's ordering constraints. 
To prevent this, the plan generator must check for threats and remove them by adding one 
of two possible ordering constraints: At < Ap (demotion) or Ac < At (promotion). 

A tutorial introduction to POP algorithms can be found in (Weld, 1994). POP is a 
regressive framework for partial order planning that starts with the null plan and continu- 
ously updates it by inserting new actions and removing threats. This process continues until 
the precondition and the prevail conditions of every operator in the plan are supported by 
some causal link and no threats exist. The first argument to POP is a plan and the second 
argument is an agenda of goals that need to be supported by causal links. Each item on the 
agenda is represented by a pair {"^ijA) where is either pre- or prevail condition of a plan 
action A. The last argument to POP is the whole collection of the operators defined by 
the planning instance. The initial call to POP contains the null plan, a specially initialized 
agenda, and the operator set A of the given problem. 

In this paper we introduce a specialized, deterministic POP algorithm that starts the 
planning process using a variant of the null plan which encodes the planning problem. In 
particular, if the planning instance has . . . ,f* as the goal then the corresponding null 
plan has exactly 2n dummy unary operators, A = {A^, . . . , A^, AI, . . . , A"^}, n ordering 
constraints, O = {{A^ < AI}, . . . , {A'^ < A*}}, and no causal links, C = {}. For every 
Vi G V, A^ is the corresponding "start" operator - it has neither pre- nor prevail conditions, 
and its effect specifies the value of the variable Vi in the initial state, which is denoted by 
v^. Similarly, A* is the "end" operator - it has no effect, no prevail conditions, but its 
precondition is set to the value of Vi in the goal state, which in turn is denoted by v*.^ 

8. Actually, the goal state may not specify the values of all the variables, thus the number of the end 
operators can be less than n. However, for clarity of presentation, we leave this definition of the null 
plan. 
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Our description of the null plan is modified from that of Weld (1994) to better suit the 
restriction to unary operators. Likewise, the initial call to our POP algorithm contains the 
agenda {(^i*, ^t), ...,«, ^*)}. 
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Appendix B. Proofs and Auxiliary Results 

Lemma 2 // forward-check was successful then pop-PCG will return a valid plan. 

Proof: The lemma will follow from the following claims: 

1. For every agenda item, there exists an operator that has it as an effect. 

2. There are no threats in the output of POP-PCG. 

3. The ordering constraints in O are consistent. 

4. The agenda will be empty after a polynomial number of steps. 

(1+4) The first claim follows from the success of the FORWARD-CHECK procedure. 
FORWARD-CHECK implies that for any ul G <j{vi) there is an operator instance A{yl) G 
Vi U {A^}. Therefore, if vl G (j{vi) then the existence of an appropriate Aadd is promised. 

Assume to the contrary that ^ cr{vi) and, without loss of generality, assume that this 
is the first iteration that it happens. If so, then for each variable u E succ(u.j), there is no 
edge labeled by in the graph G'{u), which is created by the FORWARD- CHECK. From this 
follows that Aneed Cannot have as a prevail condition, and thus A^eed has to affect the 
variable Vi itself. In this case either Aneed = A{uf'^^) or Aneed = A^. 

Consider the former case: If Aneed = A{ij'^^) then ij'^^ was previously selected from the 
agenda. By our assumption it means that vf~^^ G (j{v,i), and it contradicts our assumption 
that ^ cr{vi) since uj is a predecessor of vf^^ on a{vi). 

Now consider the last option that Aneed = A*. If Aadd = Aluj) then the goal value of 
the variable Vi is consistent with lyf, and A{irl ^) G ^ (see Step 3(b)ii). If A{i/j ^) £ A 
then i^'^ was previously selected from the agenda. By our assumption it means that 
lyf^ G 17 (vi). However, it contradicts our assumption that i^f ^ a{vi) since a{vi), by 
definition, terminates with a node consistent with the goal value of Vi. 

In addition, since we have shown that the only operators added into A are those from 
Ti U {A*-, A*}, for 1 < i < n, the agenda will be empty after O(n^) steps. 

(2) Suppose that some operator At threatens Ap Ac, i.e., 

• OU {Ap < At < Ac} is consistent, and 

• At has -i-di as an effect. 

For a given variable Vi, POP-PCG forces the operators affecting Vi as follows (Step 4): 

^0 = A{iyl) < A{vf) <...< A{vf), x < FMaxReq(i;i) (4) 

Thus Ac can only be an operator with t?j as a prevail condition. Note that Ap and At 
affect the same variable Vi. In (1) we already showed that 'di = vf G o"(fj). In that 
case At = A{v^^, I > j. However, if is a prevail condition of Ac then the ordering 
constraint {Ac < A{vj^^)} was added to O at Step 6. If so then from Eq. 4, it follows that 
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{A{ul) < A{i4^^),A{i4^^) < A{i^l)}, I > j, will be implied by O U {Ap < At < A^}, and it 
contradicts with the assumption that O U {Ap < At < Ac} is consistent. 

(3) The ordering constraints are consistent if no two operators Ai and Aj are such that 
O implies {{Ai < Aj}, {Aj < Ai}}. In what follows, Ai will be used to denote an arbitrary 
operator affecting variable Vi. 

First note that each ordering constraint added in Step 4 or Step 6 is cither between 
operators affecting the same variable or between operators affecting a variable and its child 
(with respect to the causal graph). In particular, if Ai < Aj was added in Step 4 then either 
Vi = Vj or Vi G pred(vj), whereas if Ai < Aj was added in Step 6 then Vj G pred{vi). 

Assume, to the contrary that O implies Ai < Aj and Aj < Ai. From the argument 
above, we know that there is a, possibly empty, path between Vi and Vj in the undirected 
graph induced by the causal graph. By our structural assumption, we know that such an 
undirected path between Vi and Vj is unique, and thus the situation is as follows: We have 
two chains of operators 

a : Ai = A}^ < . . . < Af° < Al < ... < Af'^ < < A} < ...< ™ = A^ 

P : Ai = A\^>...>Af^>A\^> ... >1^> >A]^>...>Ay2=A, 



such that, for < fc < m, both > 1 and yk > 1. The corresponding unique undirected 
path between Vi and Vj is: 

Vi = Vi^-Vi^- Vi^_^ ■ Vi^ = Vj 

Without loss of generality, the internal elements of a and (5 are disjoint. Otherwise, if there 
is an operator B that belongs to the internal parts of both a and (3 then we can reduce 
these chains and deduce Ai < B and Ai > B. 
The proof of consistency is as follows: 

(a) We prove that if such a and /3 exist then at least one of them should have at least 
one internal element. 

(b) We show some useful property of a and /?, that is exploited in (c). 

(c) We show that for < A; < m, Af' and are different except when xq = yo = 1. 
Note that (a) together with (c) contradicts our assumption that A^^ = AJ^ . 

(a) Assume, to the contrary, that both a and /? do not contain any internal elements. 
If so, then the algorithm actually adds to O ordering constraints Ai < Aj and Aj < Ai. 
If Vi and Vj are the same variable then Ai < Aj can only stem from Step 4 and only 
because Ai has the precondition of Aj as its effect. However, by definition of forward- 
check, Aj can not have the same role w.r.t. Ai and thus it is impossible that Aj < Ai was 
added to O. Alternatively, if Vi is a parent of Vj then Ai < Aj can stem only from Step 4 
because Ai has the prevail condition of Aj as its effect. Suppose that Ai = ^(b;-) and thus 
€ prv{Aj). In turn, Aj < Ai can be added only in Step 6 because Aj has the precondition 
of Ai as the prevail condition. But if bj G pr\/(Aj) then Ai = A{vj-f), and this contradicts 
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our assumption that Ai = A(b^). Alternatively, we can assume that Ai = ^(wj) but this 
situation is completely symmetric, and thus the result will be the same. Hence we proved 
that either a or /? have to contain at least one internal element. In particular it means 
that the next to last elements of a and (3 are different and this fact is exploited later in the 
proof. 

(b) Consider subchains of a that consist of operators affecting only one particular vari- 
able. For each such a subchain, i.e. for < fc < m, for 1 < j < Xk — 1, the ordering 
constraint Aj < Aj^^ can only stem from Step 4 because A-^ has the precondition of A-'-'^^ 

as its effect. Thus, post(Aj^) = pre(yl]^^). Similarly, for the subchains of /3, for < A; < m 
and for 2 < j < y^, post(^]^) = pre(A^~^). In what follows we denote this property of a 

and f3 by local m,onotonicity. 

(c) First suppose that either xq > 1 or yo > Ij or both. Consider the following sequence: 

Prom the local monotonicity, the construction of FORWARD-CHECK, and the fact that I7I > 2 
it follows that post(^^g ) appears before posX.{A^°) on the maximal sequence cr^.^ . Continuing 

with the next variable Vi^ we claim that post(^^^ ) has to appear before \>ost{A^^) on ay^^. 

(i) li is a parent of vi^ then A^° < Aj^ can only stem from Step 4 because A^° has 
the prevail condition of Aj_^ as its effect. In turn, A^^^ > Aj_^ can only stem from Step 6 
because the prevail condition of Aj^ is the precondition of From the relation between 
A^° and A^°, and the construction of G'^ivi^) in FORWARD-CHECK, it follows that post{Al^) 
appears before post{Aj_^) on a{vi^). Subsequently, from the local monotonicity it follows 
that post(A^^^) appears before post(^^^^) on a{vi-^). 

(a) Similarly, if vi^ is a parent of vi^ then Af^ > A\^ can only stem from Step 4 because 
A\_^ has the prevail condition of as its effect, and A^° < Aj^ can only stem from Step 6 
because the prevail condition of A^'^ is the precondition of Aj^. Prom the relation between 
A^^^ and Af", and the construction of G'^{vi^) in FORWARD-CHECK, it follows that post(A,- J 
appears before post(A|^) on a{vi-i), and again, from the local monotonicity it follows that 
post(^^j^) appears before post(A^^^) on a{vi-^). 

Alternatively, if xq = yo = 1 then A^° = A^° = A^. Prom (a) immediately follows 
that Aj^ / Aj_^, and an analysis similar to the above shows that post(^|'^^) appears before 
post(A^p on a{vi^). 

Having established that post(^^^^) appears before post(A^^^) on it is apparent 

that an inductive argument will allow us to show that for all A; > we have that postf^^*") 

appears before post(yl,-'') on a{vi^). Note that in particular it means that the operators 
A^'' and Ay'° are different, and this contradicts our assumption that Af"' = A^"" . □ 
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Theorem 2 Plan existence for strips planning problems with unary operators and 
directed-path singly connected causal graph is TS!P -complete. 

Proof: First we show the membership in NP. Let MinPlanSize(n) denote the size of the 
minimal plan for a problem instance 11. Using the MaxReq property of the state variables, 
the following upper bound for MinPlanSize(n) is straightforward from the Lemma 1: 

MinPlanSize(n) < ^ MaxReq('y) < (5) 

vev 

Thus, if we guess a minimal solution for a given solvable problem, we can verify it in low 
polynomial time. 

The proof of the hardness is by polynomial reduction from 3-SAT to the corresponding 
propositional plan generation problem with a directed-path singly connected causal graph. 
3-SAT is the problem of finding a satisfying assignment for a propositional formula in con- 
junctive normal form in which each conjunct (clause) has at most three literals. 

Let ^ = Ci A. . . ACn be a propositional formula belonging to 3-SAT, and let Xi, . . . , Xm 
be the variables used in An equivalent propositional planning problem with a directed- 
path singly connected causal graph can be constructed as follows: The variable set is V = 
{Xi,Xi, . . . , Xm, Xm} U {Ci, . . . ,Cn}- The variables X^ and Xi have no predecessors in 
the causal graph, thus pred(Xi) = pred{Xi) = {0}. In turn, for 1 < ? < n, pred(Q) = 
{Xi^,Xij^,Xi^,Xi2,Xig,Xig}, where Xi^,Xi^, and Xi^ are the variables that participate in 
the ith clause of T. Finally, Init and Goal consist of false and true assignments to all 
variables in V, respectively. 

Let every operator A G A be presented as a three-tuple ({pre}, {post}, {prv}) of pre, post, 
and prevail conditions respectively. Then, the corresponding operator set A is specified as 
follows: 

Ax, ={ ({/},{*},{}) } 
Ax, ={ ({/},{*},{}) } 

Aq ={ ({/},{*},{«!}), ({/},{*}, 14}), ({/},{*}, {4}> } 

where a*- (1 < j < 3) corresponds to the truth assignment on the variable X^. that satisfies 
the ith. clause of T. Let d = {Xi V X2 V X^). Then a\ = {Xi =t,Xi = /}, = {X2 = 
/, X2 = t}, and al = {Xs = t,Xs = /}. 

To illustrate the proposed reduction consider the following example. Formula consist 
of 3 clauses: (xi V X2 V X3) A {xi V X2 V X4) A {x2 V X3 V X4). The causal graph of the 
corresponding planning problem is as follows: 
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This is a prepositional planning problem with single-effect operators and an underlying 
directed-path singly connected causal graph. Clearly, Goal is reachable (11 is solvable) if and 
only if a satisfying assignment for can be found. Thus, plan existence for the propositional 
planning problems with directed-path singly connected causal graphs is NP-complete. □ 



Lemma 4 Given k ordered sequences ai,--- ,ak of n elements each, the number T[k] of 
different merges o/ ui, • ■ ■ ,ak, preserving the orderings induced by ai, - ■ ■ , cr^ on their ele- 
ments, is given by: 

Proof: Considering the merge operation of such k sequences as iterative merge of ai, 
2 < i < k, with the already merged sequences cti, . . . ,(Tj_i, it is easy to see that T{k) can 
be expressed as: 

= iT[k-l]-S[n{k-l),n], k>l 
[1) k = 1 

where S[x,y] stands for the number of different, order preserving merges of two ordered 
sequences of sizes x and y (without loss of generality, we assume that x >y). 

We consider the process of merging two ordered sequences a and a', \a\ > \a'\, as: 

(i) partition of a' into j sub-sequences, 

(ii) partition of a into / sub-sequences, where j — l<l<j + l, and 

(iii) interleaving and order preserving concatenation of the sub-sequences of a and a'. 

First, observe that a' can be partitioned into 1 < j < \a'\ sub-sequences. Second, for 
any j, the numbers of different partitions corresponding to steps (i) and (ii) are ('"^1^^) and 

(''^i^^)' ^^spectively. Finally, given a pair of such partitions of a and a', there exist exactly 
one possible interleaving and order preserving concatenation as in step (iii). Therefore, we 
have: 

^(^..)^i:C::)Cr) 

and combining Eq. 7 with Eq. 8, we arrive to Eq. 6. □ 
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